Compiling Regular Computations to Fine-Grained Linear Processor Arrays
نویسنده
چکیده
Fine-grained linear processor arrays are an important class of architectures for obtaining high performance on computationally intensive algorithms with large data sets, as found prevalently in digital signal processing and scientiic computing. The vast number of processing elements on these architectures provides a immense amount of potential parallelism but at the price of limited interconnect and ne-granularity, resulting in complex constraints on synthesis. In an attempt to overcome such restrictions, compilers must search for both iteration-level and instruction-level parallelism. This paper evaluates the iteration-level scheduling techniques of space-time mapping and loop transformation theory, and the instruction-level techniques of software pipelining and trace scheduling. In particular, the applicability of each technique to linear ne-grained processor arrays is discussed. The conclusion is that a combination of loop transformation theory, software pipelining, and space-time mapping provides a potentially powerful method for scheduling to this class of architectures.
منابع مشابه
Mapping Applications to Coarse-Grain Reconfigurable Architectures
Coarse-grained reconfigurable architectures (CGRAs) are capable of achieving both goals of high performance and flexibility. CGRAs not only improve performance by exploiting the features of repetitive computations, but also can adapt to diverse computations by dynamically changing configurations of an array of its internal processing elements (PEs) and their interconnections. Many CGRAs have be...
متن کاملEnergy-Efficient Sorting on a Many-Core Platform
As processors move from multi-core to many-core architectures, opportunities arise for energy-efficient enterprise computations, such as sorting, on large arrays of processors. This paper proposes three different energy-efficient sorting methods for the first phase of an external sort simulated on a varying sized fine-grained many-core processor arrays used as a co-processor to an Intel CPU, wh...
متن کاملFast Synthesis of Large Planar Arrays Using Active Element Pattern Method and Fine- Grained Parallel Micro-genetic Algorithm
A radiation pattern synthesis technique for large planar arrays with active element pattern (AEP) method and fine-grained parallel micro-genetic algorithm (FGPMGA) is presented. Based on the AEP method, the mutual coupling between array elements can be taken into account. Analysis problems of large rectangular and triangular grid planar arrays are divided into small linear array problems. And f...
متن کاملCompiling Array Expressions for Efficient Execution on Distributed-Memory Machines
Array statements are often used to express data-parallelism in scientiic languages such as Fortran 90 and High Performance Fortran. In compiling array statements for a distributed-memory machine, eecient generation of communication sets and local index sets is important. We show that for arrays distributed block-cyclically on multiple processors, the local memory access sequence and communicati...
متن کاملHow to Emulate Fine-grained Multithreading
Fine-grained multithreading can be used to hide longlatency operations encountered in parallel computers during remote memory access. Instead of using special processor hardware, the emulation of fine-grained multithreading on standard processor hardware is investigated. While emulation of coarse-grained multithreading is common in modern operating systems, in the fine-grained case research on ...
متن کامل